Imitation Learning of Hierarchical Driving Model: From Continuous Intention to Continuous Trajectory
نویسندگان
چکیده
One of the challenges to reduce gap between machine and human level driving is how endow system with learning capacity deal coupled complexity environments, intentions, dynamics. In this letter, we propose a hierarchical model explicit models continuous intention dynamics, which decouples in observation-to-action reasoning data. Specifically, module takes perception generate potential map encoded obstacles intentions. Then, regarded as condition, together current trajectory output by function approximator network, whose derivatives can be used for supervision without additional parameters. Finally, our method validated both datasets stimulation, demonstrating that has higher prediction accuracy displacement velocity generates smoother trajectories. Our also deployed on real vehicle loop latency, validating its effectiveness. To best knowledge, first work produce using network. code available at https://github.com/ZJU-Robotics-Lab/CICT.
منابع مشابه
Continuous-Time Hierarchical Reinforcement Learning
Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Prior work in hierarchical RL, such as the MAXQ method, has been limited to the discrete-time discounted reward semiMarkov decision process (SMDP) model. This paper generalizes the MAXQ method to continuous-time discounte...
متن کاملBetween Imitation and Intention Learning
Research in learning from demonstration can generally be grouped into either imitation learning or intention learning. In imitation learning, the goal is to imitate the observed behavior of an expert and is typically achieved using supervised learning techniques. In intention learning, the goal is to learn the intention that motivated the expert’s behavior and to use a planning algorithm to der...
متن کاملLearning a language model from continuous speech
This paper presents a new approach to language model construction, learning a language model not from text, but directly from continuous speech. A phoneme lattice is created using acoustic model scores, and Bayesian techniques are used to robustly learn a language model from this noisy input. A novel sampling technique is devised that allows for the integrated learning of word boundaries and an...
متن کاملContinuous Spatiotemporal Trajectory Joins
Given the plethora of GPS and location-based services, queries over trajectories have recently received much attention. In this paper we examine trajectory joins over streaming spatiotemporal data. Given a stream of spatiotemporal trajectories created by monitored moving objects, the outcome of a Continuous Spatiotemporal Trajectory Join (CSTJ) query is the set of objects in the stream, which h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2021
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2021.3061336